Site integration management system for operational support service in an internet data center

ABSTRACT

A management system for an Internet Data Center, comprising an Information Technology and Network Management System IT&amp;N for managing the Data Center, a Building Management System BMS for monitoring facilities related to the IDC such as access control, fire protection and intruder protection, and a Facility Management System FMS for maintaining geographical views and information about equipment, furniture and building space. The BMS is connected to the IT&amp;N for transmitting event data whenever an event occurs, such as when a parameter associated with the event leaves a predetermined range. The IT&amp;N comprises an Operational Support Service OSS for processing event data, defining actions resulting from the processing and transmitting commands to the BMS and/or to the IT&amp;N in response to the defined actions.

This application is a divisional of Ser. No. 10/234,065; filed on Sep.3, 2002 now U.S. Pat. No. 6,980,978.

FIELD OF THE INVENTION

The present invention relates to Internet Data Centers (IDC) which offerto customers a combination of solid and secure high speed networking anddata center services including servers, storage and network transport,and relates in particular to a management system for an Internet DataCenter.

BACKGROUND

The management of an Internet Data Center (IDC) is generally achieved byseveral entities being functionally and physically different and havingno relationship between them: an Information Technology and NetworkingManagement System (IT&N), a Building Management System (BMS) and aFacility Management System (FMS).

The BMS monitors all systems related to the site such as access control,fire protection, and intruder protection. Specific drivers can integratespecific equipment such as Uninterruptible Power Systems (UPS), chillersor other equipment that is part of the IDC Site installation. A BMS is astructured, organized and integrated system that easily supervises theentire IDC site infrastructure and thus provides a unified management.It integrates and manages all the Site related available data andverifies that their values are in predefined ranges. It may launchcorrective actions and may provide emergency services in case ofdetected problems. The user can easily participate in the decisions byaccessing a collection of documents related to all site systemmalfunctions such as geographical location, alarm lists and texts, andgraphical value views of each detector.

The FMS supervises all needs related to the Site asset management andfacility descriptions. It maintains up to date all geographical viewsand information about equipment (IT and non IT), furniture and buildingspace. With an FMS, an organization may use one source for all itsfacility information needs. The FMS allows the manipulation of drawings,reports, requests and different layer views with intuitive navigation toany device in any of the drawings and display of detailed informationabout that object. Moreover, FMS is provided with maintenance andinventory functions.

The IT&N is used to administer small and large IT infrastructures. Thisentity has evolved with more specific modules highly targeting theInternet Data Center management requirements. It is divided into twoparts. The first part is the OSS (Operational Support Service), and thesecond part is related to resources such as asset management, network,material, and applications. This implementation is sometimes called IS(Integrated Solutions) because it is a structured, organized andintegrated system that easily supervises the entire IDC infrastructureand thus provides a unified IT system management.

The central point of IT&N is constituted by the Operational SupportServices (OSS) which are key services for a successful implementation ofan IDC. It is this system that gives the personality and the efficiencyto the Internet Data Center, serving as the glue between all themanagement components. It allows a service-oriented businessimplementation of the IDC management platform and creates a competitiveadvantage by managing the Service Delivery through an integrated,management solution. It is today critical to offer very highavailability to a mission-critical infrastructure such as an IDC.Service Level Agreements (SLA/) are becoming key commitments required ina customer's contract, with guaranteed compensation if specified levelsof availability, speed and bandwidth are not met.

An OSS is often built today using IT and Networking management platformssuch as “TIVOLI” (IBM trademark) or “Computer Associate”. In order tointegrate the Site management requirements into the OSS, severalfunctions are generally added. Site management applications such as GPAX(IBM trademark) and Aperture (Aperture trademark) which are capable ofcommunicating with other applications, can be the right addition to theOSS platform. Each application brings new added value, defines skillsand create an effective global management platform where all thedependencies can be covered (Site infrastructure, IT infrastructure andNetworking).

The set of applications selected fulfill the groupware concept for anOSS. However, none comply with the previous integration criteria tocreate a single groupware environment for an improved OSS platform. Fullscalability and easy adaptation to one or several sites requiresspecific capabilities that many site management applications do nothave. Some look simple for small configurations but become very complexin large-scale environments. Others are highly capable of supporting adefined set of requirements but are very poor regarding datacommunication to support the groupware environment. For an IDC, the datatransfer, data storage and data processing must be highly secured andefficient.

Japanese patent JP 07023479 entitled “Building Automation System” (BAS)discloses a method that could be nearly a BMS. It relates to aninstitution management apparatus for multistory building which convertsa point address contained in alarm information into a symbol expressingapparatus, area and floor classification. In large installations, allHVAC components (sensors, counters, contacts, etc.) are connected via aDDC/PMS (Direct Digital Control/Programmable Management System) unit toa BAS and the BAS is connected to the BMS. This patent considersmanagement of only the HVAC components (for example air conditioning, orpower supply equipment). In fact, many BMS manage several subsystemslike HVAC but also access control, intruder alarm system, and fire alarmsystems. A powerful user interface is provided, which gives access toschematics and symbols. The schematics are graphical views (e.g. groundplan of a floor, or the block circuit diagram of an air-conditioningsystem) or displays of field points which are updated dynamically. Thefield point definition determines the position of the detector in theschematic, and which appearance and information it has to have. A fieldpoint can be defined as an active button; clicking on the button bringsup an emergency list, for example, or may be area and floorclassification.

U.S. Pat. No. 4,526,068 discloses a method to collect information aboutvarious parameters such as humidity, and temperature. Theseenvironmental parameters are measured with a portable device andregistered together with information giving the time and location ofeach measurement. The readings are interrelated parameters, e.g.temperature and relative humidity. They can be subjected to correctionssuch as linearization. A keyboard on the device enables theinstantaneous read-out of a selected parameter on a common displaypanel, or the detection of dangerous concentrations of noxious gassesmay set off an alarm.

This U.S. patent as well as the above Japanese patent relate only to onepart of the overall fulfilling requirements for the complete groupwaresolution. Indeed, since each part of the IDC operation is run by adifferent organization with specific roles and competencies, there is nostructured relationship between the various management systems. The SiteFMS has little or even no groupware action on the IT&N. The.interrelations that allow committing and managing the Service LevelAgreement (SLA) are not captured by the FMS, and the BMS thus createsmajor operation risks and increases the cost of establishing the rightavailability of the IDC. This issue is also true between the BMS and theIT&N in regard to the alarms in various fields of the facility and theimpact into the IT operation. This kind of situation reducesinteroperability and creates operational deficiencies, communicationdifficulties and delay problems.

SUMMARY OF THE INVENTION

Accordingly, an object of the invention is to provide a managementsystem enabling the automation of a fully managed and autonomousInternet Data Center (IDC), reducing operational cost and optimizingdecisions and performances in regard to the business engagement for theIDC customers and the related Service Level Agreements (SLA).

Another object of the invention is to provide a management system for anInternet Data Center (IDC) providing a scalable, highly available, fullyautomated environment for the Operational Support Service (OSS),integrating the Facility Management System (FMS) and the BuildingManagement System (BMS) with minimum operator interface and low humanintervention.

The invention relates therefore to a management system of an InternetData Center (IDC) comprising an Information Technology and NetworkManagement System IT&N for managing the Internet Data Center, a BuildingManagement System BMS for monitoring all facilities related to the IDCsuch as access control, fire protection and intruder protection, and aFacility Management System FMS for maintaining up to date geographicalviews and information about equipment, furniture and building space ofthe IDC. The BMS is connected to the IT&N for transmitting thereto eventdata each time an event occurs, such an occurrence being achieved whenone or several parameters associated with the event are out of apredetermined range of values. The IT&N comprises Operational SupportService OSS for processing the event data, defining actions resultingfrom the processing and transmitting commands to the BMS and/or to theIT&N in response to the defined actions.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other objects, features and advantages of the inventionwill be better understood by reading the following more particulardescription of the invention in conjunction with the accompanyingdrawings wherein:

FIG. 1 is a block diagram of an Internet Data Center Management systemaccording to the invention comprising an Information Technology andNetwork Management System (IT&N), a Building Management System (BMS) anda Facility Management System (FMS) and the links there between.

FIG. 2 is a flow chart illustrating the steps of the method used in theinvention to process the event data from the BMS in the OperationalSupport Services of the IT&N.

DETAILED DESCRIPTION

According to the invention, the management system for the Internet DataCenter (IDC) illustrated in FIG. 1 includes an Information Technologyand Networking Management System (IT&N) 10 for managing the IDC, themain part of which being the Operational Support Services (OSS) 12, aFacility Management System (FMS) 14 for maintaining up to dategeographical views and information about equipment, furniture andbuilding space of the IDC, and a Building Management System (BMS) 16 formonitoring facilities related to the IDC such as access control, fireprotection and intruder protection.

The FMS 14 is a visual tool providing several independent functionswhich can be represented by various modules (not shown) such as thefollowing:

-   -   A Furniture Management module which tracks furniture        inventories, both existing and warehouse. It also provides for        space planning and reconfigurations with automatic reporting of        new requirements vs. existing inventories. It enables        determination of which items can be used from existing and        warehouse inventories and what additional items need to be        purchased or what excess inventory needs to be warehoused.    -   An Equipment Management module, which tracks equipment        inventories, financial information on leased vs. owned        equipment, service contracts, warranties, and equipment        capabilities.    -   A “Request system by Intranet” module provides facilities and IT        managers with an automated way to receive, route, delegate, and        manage employee service, equipment, new hire, and move requests        over an Intranet. It is a web-based work request and work flow        management system. It is based on an e-commerce model and is        customizable to fit an organization's particular needs.        Ultimately, it helps to relieve the problems that facilities        managers, IT managers, and their customers meet when making or        managing requests.    -   A Space Management module, which measures easily and accurately        the building, floor, tenant, department, and/or assigned office        space. These measurements are used to charge occupants for space        costs and/or analyze space use efficiencies using a variety of        methods. The Space Management function allows for measurement of        each floor's gross, core, building common, floor common,        department space, circulation, and assigned space, and then        apportions the space to each department, division, and so forth.

In general, the FMS provides four kinds of functionality: “Drawings”,“Stylizer”, “Show” and “Find” functions.

-   -   “Drawings” is a repository of drawings obtained by Auto CAD        document imports or using FMS drawing tools, and may include:        -   Drawings related to each IDC (eventually if several tenant            sites).        -   Drawings related to the building(s): boundaries, and so            forth.        -   Drawings related to space management (if there isn't a            related database): space vacancy, cost space.    -   “Show” is a layer representation. The selected elements appear        on the foreground, such as:        -   Furniture visualization        -   Equipment (Networking system and BMS) visualization        -   BMS alarm visualization        -   Networking system alarm visualization.    -   “Styliser” associates colors to different elements, depending on        a criterion. For example, if the criterion is the date, it may        visualize the more recent interventions in a different color.        For example:        -   Maintenance        -   Space visualizations (if there isn't a related database):            vacancy.    -   “Find” provides a very interesting functionality: to locate an        employee. A reference table associates a name with a location.        For an employee name, it returns an associated location in the        building. This request system can be used to support the        following:        -   BMS alert requests        -   Networking system alerts requests        -   BMS equipment requests        -   Networking system equipment requests.

The BMS 16 comprises an equipment database 18 containing the datarelating to the IDC equipment and an alarm and status database 20containing the data relative to IDC detectors which are responsive toset alarms. All the data contained in this database can be visualized inthe BMS. By this way the parameters of each detector are customizable.Each information point (IP) represents the address of a connected devicesuch as a smoke detector of a fire alarm system, a glass break sensor ofan intruder alarm system, or information from facilities that are partof the HVAC installation in a building such as heat meters or flowmeters. The IP definition determines how the message of the facilitysystem should be interpreted, e.g. conversion of the raw value inimpulse number into the actual unit of measurement in m3 of gas.

The data contained in the equipment database 18 and in the alarm andstatus database 20 are transmitted for visualization to the FMS 14respectively by data link 22 and data link 24.

Different data links could be used in the implementation such as thesynchronous link ODBC (standing for Open DataBase Connectivity), whichis a Microsoft strategic interface for accessing data in a heterogeneousenvironment of relational and non-relational database management systemsor an asynchronous link, e.g. MQ series.

The data link 22 between database 18 and, the FMS 14 collectsinformation related to the BMS equipment. For this, the informationrelating to the IP window mentioned above can be used since it providesthe address and the name of each detector. The data link 24 betweendatabase 20 and the FMS 14 collects information relating to the BMSalarms. For this, the information related to the IP alarm list can beused. Note that only alarms with state=ACTIVE and status=ALARM should beselected.

The OSS 12 of IT&N 10 includes several independent modules which areimportant to the invention. The modules are:

-   -   Inventory unit 26 which offers enterprise class inventory        management. This unit automatically scans for and collects        hardware and software configuration information from computer        systems in the customer's enterprise. At the same time, it is a        powerful tool for empowering help desk support technicians, and        can significantly reduce the costs of customer asset management        efforts.    -   Collecting data unit 28 is a tool for transforming raw data into        information. This information is presented in a variety of        graphical formats that can be viewed interactively (slice, dice,        drill down, or drill through for example). Using this unit,        graphical views of predefined data can be obtained.    -   Decision unit 30 manages the information system from a wide        variety of sources in a client/server environment. This unit        handles information (events) according to priorities previously        determined, correlates events from one source with those from        another source, applies decisions about those events through the        use of rules, takes corrective actions, and informs the right        people when required. It provides a centralized location for        managing events. It validates incoming events. When the database        entry for an event is created or modified, the event is analyzed        to see if any existing rules apply to the event.

The inventory unit 26 includes an equipment database and the decisionunit 30 includes an alarm and status database with the same functions asdatabases of BMS. These databases are linked by respectively an ODBClink 32 and an ODBC link 34 to FMS 14.

The Collecting Unit 28 provides a panel used to display graphical viewsof the BMS equipment thanks to an ODBC link 36 from the BMS equipmentdatabase 18. Different values related to BMS parameters are monitored.By this way, it is possible to correlate different information and toanalyze causes and consequences of environment trends.

First, the collecting unit chooses the representative BMS detectors. Foreach room of the IDC, one detector of each system of interest isselected. Thus, each room corresponds to a set of BMS detectorsreporting a set of values. Then, the values representing the behavior ofthe systems and the environment states are visualized. The userinterface of the collecting unit 28 is in general divided into threeparts: a Topic Map pane which is customized by defining the BMSparameters being chosen, a View pane for displaying the graphicsrepresenting the parameters such as temperature, humidity, and a Helperpane for displaying messages.

Another feature of the invention is a data link 42 between the alarm andstatus database 20 of the BMS 16 and the decision unit 30 of the IT&N10. Because BMS information may not always be understandable by theDecision Unit, it may be necessary to convert this information into IT&Ninformation by using an adapter such a program having both functions ofconverting and filtering. The function of filtering permits transmissionto the OSS of only the needed information insofar as only a part ofstatus and alarm information is ordinarily interesting. Furthermore, itis important that OSS 12 may regularly visualize the critical parametersof the BMS 16 related to the IT&N environment servers. Such a functionis achieved by a monitoring unit 38 in the OSS 12 which monitors the BMS16 via a line 40, whereas the information about BMS events is exchangedbetween database 20 (alarms and status) and decision unit 30 by means ofthe already mentioned TCP/IP link 42. Note that the event data beingexchanged may be either a predefined set of information or datacorresponding to one or several parameters associated with the eventwhich are out of a predetermined range of values (it is often the casewhen a predetermined threshold has been reached).

The processing of the BMS events by the decision unit 30 is achievedaccording to the flow chart illustrated in FIG. 2. The event beingclassified in several categories, the first step is to recognize whetheran event of the category A occurs (step 50). If so, a program A islaunched by an event server located in decision unit 30 (step 52). Notethat the event server recognizes each category of events and associatesa rule to be applied. Such a rule is of the type “what to do when thisevent is received”.

When a rule is applied because an event A has occurred, the first actionof the event server is to activate a variable in several files ofcritical situations wherein the event (defined by this variable) istaken into account (step 54). The files of critical situations impliedby the event A contain all the parameters defining this situation. Theaction of activating a variable may include incrementing a counter.

The second action of the event server is to verify whether the variableslisted in any one of N critical situations 1, 2 . . . N being affectedby event A are activated (step 56, 56′ . . . ). If, for a given criticalsituation, all the variables are being activated, another program isstarted (step 58, 58′ . . . ), such a program being associated with therequirements of the related Service Level Agreement (SLA). Then, the SLArequirements program chooses a set of actions using libraries, suchactions being either on BMS environment (step 60) and included in filestransmitted to the BMS, or on IT&N environment (step 62) performed byinternal commands. When a variable is activated, a time out counter maystart to increment. When all variables are not activated for a criticalsituation within the predefined “Time out” (step 64), it is consideredthat the variable is off and may be deactivated (step 66).

EXAMPLES

1. Uninterruptable Power Supply Failure

UPS are by definition managed by the BMS 16. If a problem is detectedthat could impact the server or network infrastructure, the IT&N 10 actsin order to choose an optimized solution related to the SLA or businessenvironment at this given time.

Without the integrated architecture, when a UPS is failing, the failingUPS status and reports are provided to the BMS 16 and sometimes to theIT&N 10. Actions are launched by the facility manager to repair the UPSproblem. Finally some decisions are taken by the IT manager to stop someservers.

With the integrated architecture according to the invention, the BMS 16collects UPS status and reports an environmental capability to the IT&N10 which analyses the required historical data related to such asituation including the immediate SLA requirements and predictiveelement lists to build a decision. The IT&N 10 transfers to BMS 16 theelements of the decision. The BMS 16 transfers to all the impactedenvironmental systems the new configuration to face the situation. Forexample, the BMS 16 receives an alarm concerning a UPS failure: “theback up lead time is low”. If nothing is done quickly, servers attachedto this UPS are at risk. Based on the decision set up by the IT&N 10 forthis failure, selected servers are shut down while others arereconnected by the BMS 16 to other UPS. The analysis of the problem(depending on criteria) establishes the urgency level that the IDCorganization must act on to resolve this failure. Back up functions canbe decided by the organization to recover a defined availability level.

2. IDC Maintenance Program

The FMS 14 launches work sheets concerning maintenance programs. Severalschedules are proposed in order to supervise preventive and correctiveactions on the equipment. Without the integrated architecture, thoseschedules are built on the maintenance timetable provided by eachequipment supplier.

With the integrated architecture according to the invention, the IT&N 10and the FMS 14 are connected to establish a general IDC maintenanceprogram taking into account the SLA requirements and the ITinfrastructure changes. SLA requirements are used to plan maintenancewindows out of the critical periods corresponding to the highavailability needs. It is thus possible to define at the same time anefficient workforce planning and an on time supply of the maintenanceparts to minimize the total cost of ownership.

In conclusion, the system according to the invention described hereabove provides architecture rules for a complete OSS based on a scalableplatform that allows important savings in time, money and energy. Thisplatform creates a single system and thus is the grouping of human andcomputing resources into the organization structure of the IDC. Actionsand decisions can be made more effectively and more rapidly by providingexchanges between the applications and therefore, reporting the rightinformation only to the right set of-persons managing the IDC, with aminimum set of control and decision mechanisms.

Furthermore, the system according to the invention allows treating themain operations at a global level while having dedicated people andorganization in charge of defined roles.

Such Groupware can correlate and visualize the interactions andinfluences between each decision and the global business objective ofthe IDC. The decisions taken and actions done will be the best ones forthe total IDC delivery objective with little or no impact on the IDCbusiness objective. Decision tools can be structured to automate many ofthe decisions while sending to the NOC the few crucial cases thatrequire human intervention.

1. A Management system for an Internet Data Center (IDC), comprising: anInformation Technology and Network Management System (IT&N) for managingsaid Internet Data Center; a Building Management System (BMS) formonitoring facilities related to said IDC; and a Facility ManagementSystem (FMS) for maintaining up to date geographical views andinformation about equipment, furniture and building space of said IDC;wherein said BMS is connected to said IT&N for transmitting event datafrom said BMS to said IT&N each time a predefined event occurs, saidIT&N comprises an Operational Support Service (OSS) for processing saidevent data, defining actions resulting from said processing, andtransmitting commands to said BMS and to said IT&N in response to thedefined actions; said BMS is configured to manage an uninterruptiblepower supply failure that could impact a server or networkinfrastructure; and said FMS is configured to launch work sheetsconcerning maintenance programs.
 2. The system according to claim 1,wherein occurrence of an event is achieved when at least one parameterassociated with said event is out of a predetermined range of values. 3.The system according to claim 1, wherein said OSS includes a decisionunit for handling said event data in accordance with predeterminedpriorities, correlating event data from one source with event data fromanother source, applying decisions about the predefined events by usingrules and taking appropriate corrective actions, and wherein thedecision unit is linked to the FMS.
 4. The system according to claim 3,wherein said BMS includes an alarm and status database wherein arestored said event data and wherein are stored data relative to IDCdetectors which are responsive to set alarms, said alarm and statusdatabase being linked to said decision unit by a TCP/IP link fortransmitting said event data thereto.
 5. The system according to claim4, wherein said BMS includes visualization means for visualizing theinformation stored in said alarm and status database.
 6. The systemaccording to claim 1, wherein said BMS includes an equipment database,said equipment database being linked to said FMS by data links forvisualizing information about said BMS.
 7. The system according to claim1, wherein said OSS includes an inventory unit for collecting hardwareand software configuration information from computer systems in acustomer's enterprise, and wherein the inventory unit is linked to theFMS.
 8. The system according to claim 7, wherein said inventory unitincludes a database linked to said FMS by data links for visualizingequipment data.
 9. The system according to claim 1, wherein said OSSincludes a collecting unit linked to an equipment database of said BMSby a data link displaying graphical views of BMS equipment comprised bysaid BMS.
 10. The system according to claim 1, wherein said OSS includesa monitoring unit linked to said BMS for regularly visualizing BMSparameters related to IT&N environment servers.
 11. The system accordingto claim 1, wherein said FMS includes a furniture management module fortracking furniture inventories, an equipment management module fortracking equipment inventories, a request system by Intranet module forproviding facilities to receive, route, delegate and manage employeeservice, equipment, new hire and move requests over an Intranet, and aspace management module for measuring assigned office space.
 12. Thesystem according to claim 7, wherein the inventory unit includes anequipment database with the same functions as an equipment database ofthe BMS.
 13. The system according to claim 4, wherein the decision unitincludes an alarm and status database with the same functions as thealarm and status database of the BMS.
 14. The system according to claim4, wherein said decision unit includes programs associated withcategories of events, a program of said programs associated with thecategory corresponding with said event data received from said alarm andstatus database being started upon receiving said event data in order toactivate a variable associated with said event data included in aplurality of lists of variables corresponding to a plurality ofsituations.
 15. The system according to claim 4, wherein said BMSfurther includes an equipment database, said equipment database and saidalarm and status database being linked to said FMS by data links forvisualizing information about said BMS.